No-Φ-Regret: A Connection between Computational Learning Theory and Game Theory
نویسندگان
چکیده
This paper explores a fundamental connection between computational learning theory and game theory through a property we call no-Φ-regret. Given a set of transformations Φ (i.e., mappings from actions to actions), a learning algorithm is said to exhibit no Φ-regret if an agent experiences no regret for playing the actions the algorithm prescribes, rather than playing the transformed actions prescribed by any of the elements of Φ. The existence of no-Φ-regret learning algorithms is established, for all finite Φ. Analogously, a class of game-theoretic equilibria, called ~ Φ-equilibria, for ~ Φ = (Φi)1≤i≤n, is defined (here n is the number of agents/players). The main contribution of this paper is to show that that the empirical distribution of play of no-Φi-regret algorithms converges to the set of ~ Φ-equilibria. The well-known result that the empirical distribution of play of nointernal-regret learning converges to the set of correlated equilibria follows as an immediate corollary of this general theorem. In addition to providing a sufficient condition, a necessary condition for convergence to the set of ~ Φ-equilibria is also derived. This work was originally motivated by an attempt to design a no-regret learning scheme that would converge to a tighter solution concept than the set of correlated equilibria. However, it is argued that the strongest form of no-Φ-regret learning is no-internal-regret learning. Hence, the tightest game-theoretic solution concept to which any no-Φ-regret algorithm converges is correlated equilibrium. In particular, Nash equilibrium is not a necessary outcome of learning via any no-Φ-regret algorithms.
منابع مشابه
A General Class of No-Regret Learning Algorithms and Game-Theoretic Equilibria
A general class of no-regret learning algorithms, called no-Φ-regret learning algorithms, is defined which spans the spectrum from no-external-regret learning to no-internal-regret learning and beyond. The set Φ describes the set of strategies to which the play of a given learning algorithm is compared. A learning algorithm satisfies no-Φ-regret if no regret is experienced for playing as the al...
متن کاملRegret Minimization and Job Scheduling
Regret minimization has proven to be a very powerful tool in both computational learning theory and online algorithms. Regret minimization algorithms can guarantee, for a single decision maker, a near optimal behavior under fairly adversarial assumptions. I will discuss a recent extensions of the classical regret minimization model, which enable to handle many different settings related to job ...
متن کاملDistributed Computing with Adaptive Heuristics
We use ideas from distributed computing to study dynamic environments in which computational nodes, or decision makers, follow adaptive heuristics [16], i.e., simple and unsophisticated rules of behavior, e.g., repeatedly “best replying” to others’ actions, and minimizing “regret”, that have been extensively studied in game theory and economics. We explore when convergence of such simple dynami...
متن کاملAnalysis of the Evolutionary Game Theory in Agent-Based Computational Systems: OPEC Oil-Producing Countries
This study suggests a new method for analysing the behavioral economics issues in the framework of game theory. In this context, bounded rational agents interact with one another in a strategic manner. Therefore, conventional economic modeling techniques are unable to explaine this kind of interactions. In this regard, evolutionary game theory and agent-based modeling are known as the most suit...
متن کاملAdvanced Course in Machine Learning Spring 2010 Games Against Nature
In the previous lectures we talked about experts in different setups and analyzed the regret of the algorithm by comparing its performance to the performance of the best fixed experts (and later the best shifting expert). In this lecture we consider the game theory connection and present games against Nature. Along the way, we present one of the most common tools to analyze prediction problems:...
متن کامل